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INFORMATION STORAGE MEDIUM CONTAINING SUBTITLE DATA 
FOR MULTIPLE LANGUAGES USING TEXT DATA AND 
DOWNLOADABLE FONTS AND APPARATUS THEREFOR 

5 Technical Field 

The present invention relates to an information storage medium 
on which subtitles for supporting multiple languages using text data and 
downloadable fonts are recorded and an apparatus therefor. 

10 Background Art 

Conventional digital versatile discs (DVD) use bitmap images as 
subtitles. Subtitle data of bitmap images are losslessly coded and 
recorded on a DVD, on which a maxinrium of 32 subtitles can be 
recorded. 

15 The data structure of video data on a DVD, which is one of the 

several types of conventional multimedia information storage media, will 
now be explained. 

FIG. 1 is a diagram of a data structure for a DVD. 

Referring to FIG 1 , the disc space of a DVD that is a multimedia 
20 storage medium is divided into a VMG area and a plurality of VTS areas. 
Title infomnation and information on a title menu are stored in the VMG 
area, and information on the title is stored in the plurality of VTS areas. 
The VMG area comprises 2 to 3 files and each VTS area comprises 3 to 
12 files. 

25 FIG. 2 is a detailed diagram of a VMG area. 

Referring to FIG. 2, the VMG area includes a VMGI area storing 
additional information on the VMG, a VOBS area storing video 
information (video object) on the menu, and a backup area for the VMGI. 
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These areas exist as one file and among them the presence of the 
VOBS area is optional. 

In the VTS area, information on a title, which is a reproduction unit, 
and a VOBS, which is video data, are stored. In one VTS, at least one 
s title is recorded. 

FIG. 3 is a detailed diagram of a VTS area. 

Referring to FIG. 3, a VTS area includes video title set information 
(VTSI), a VOBS that is video data for a menu screen, a VOBS that is 
video data for a video title set, and backup data of the VTSI. The 
10 presence of the VOBS for displaying a menu screen is optional. Each 
VOBS is again divided into VOBs and cells that are recording units. 
One VOB comprises a plurality of cells. The lowest recording unit 
mentioned in the present Invention is a cell. 

FIG. 4 is a detailed diagram of a VOBS that is video data. 

Referring to FIG. 4, one VOBS comprises a plurality of VOBs, and 
one VOB comprises a plurality of cells. A cell comprises a plurality of 
VOBUs. A VOBU is data coded by a moving pictures expert group 
(MPEG) method of coding moving pictures used in a DVD. According to 
the MPEG method, since images are spatiotemporal compression 
encoded, in order to decode an image, previous or following images are 
needed. Accordingly, in order to support a random access function by 
which reproduction can be started from an arbitrary location, intra 
encoding which does not need previous or following images is performed 
for every predetermined Image. This image is referred to as an infra 
picture or I picture in the MPEG and those between an I picture and the 
next I picture are referred to as a group of pictures (GOP). Usually, a 
GOP comprises 12 to 15 pictures. 
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The MPEG defines system encoding (ISO/I EC1 381 8-1) for 
encapsulating video data and audio data into one bitstream. The 
system encoding defines two multiplexing methods, including a program 
stream (PS) multiplexing method which is suitably for producing one 
5 program and storing the program in an information storage medium, and 
a transport stream multiplexing method which is appropriate for malting 
and transmitting a plurality of programs. In the methods, the DVD 
employs the PS encoding method. According to the PS encoding 
method, video data and audio data are respectively divided in the units of 
10 pacl^s (PCK) and are multiplexed through time division of the pacl^s. 
Data other than the video and audio data defined by the MPEG are 
named as a private stream and also included in PCKs so that the data 
can be multiplexed together with the audio and video data. 

A VOBU comprises a plurality of PCKs. The first PCK in the 
15 plurality of PCKs is a navigation pacic (NV^PCK). Then, the remaining 
part comprises video pacl^s (V_PCK), audio packs (A_PCK), and sub 
picture packs (SP_PCK). Video data contained in a video pack 
comprises a plurality of GOPs. 

The SP_PCK is for 2 dimensional graphic data and subtitle data. 

20 That Is, In the DVD, subtitle data that appear overiapping a video picture 
are coded by the same method as used for 2 dimensional graphic data. 
That is, for the DVD, a separate coding method for supporting multiple 
languages is not employed and after converting each subtitle data into 
graphic data, the graphic data is processed by one coding method and 

25 then recorded. The graphic data for a subtitle is referred to as a sub 
picture. A sub picture comprises a sub picture unit (SPU). A sub 
picture unit corresponds to one graphic data sheet. 

FIG. 5 is a diagram showing the relation between an SPU and 
SP PCK. 
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Referring to FIG. 5, one SPU comprises a sub picture unit [leader 
(SPUH), pixei data (PXD), and a sub picture display control sequence 
table (SP_DCSQT), which are divided and recorded in this order into a 
plurality of 2048-byte SP_PCKs. At this time, if the last data item of the 
5 SPU does not completely fill one SP_PCK, the remaining part of the last 
SP_PCK is padded to have the same size as the other SP_PCKs. 
Accordingly, one SPU comprises a plurality of SP_PCKs. 

In the SPUH, the size of the entire SPU and a location from which 
SP_DCSQT data begins are recorded. PXD data is obtained by 

10 encoding a sub picture. Pixel data forming a sub picture can have 4 
different types of values, which are a background, a pattem pixel, an 
emphasis pixeM , and an emphasis pixel-2 that can be expressed by 2 
bit values and have binary values of 00, 01, 10, and 11, respectively. 
Accordingly, a sub picture can be deemed as a set of data having the 

15 four pixel values and formed with a plurality of lines. Encoding is 
performed for each line. As shown in FIG. 6, the SPU is run-length 
encoded. That is, if 1 to 3 predetermined pixel data items continue, the 
number of continuous pixels (No_P) is expressed by 2 bits and after that, 
a 2-bit pixel data value (PD) is recorded. If 4 to 15 pixel data items 

20 continue, the first 2 bits are recorded as O's, then No_P is recorded by 
using 4 bits, and PD is recorded by using 2 bits. If 16 to 63 pixel data 
items continue, the first 4 bits are recorded as O's, then No_P is recorded 
by using 8 bits, and PD is recorded by using 2 bits. If pixel data items 
continue to the end of a line, the first 14 bits are recorded as O's, and 

25 then PD is recorded by using 2 bits. If alignment in units of bytes is not 
achieved when encoding of a line is finished, 4 bits are recorded as O's. 
The length of encoded data in one line cannot exceed 1440 bits. 

FIG 7 is a diagram of the data structure of SP_DCSQT. 

Referring to FIG. 7, SP_DCSQT contains display control 
30 information for outputting the PXD data. The SP_DCSQT comprises a 
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pluraliiy of sub picture display control sequences (SP_DCSQ). One 
SP_DCSQT Is a set of display control commands (SP_DCCMD) 
performed at one time, and comprises SP_DCSQ_STM Indicating a start 
time, SP_NXT_DCSQ_SA containing information on the location of the 
5 next SP_DCSQ, and a plurality of SP_DCCMD. 

The SP_DCCMD is control Information on how the pixel data 
(PXD) and video pictures are combined and output, and contains pixel 
data color information, information on contrast with video data, and 
infomriation on an output time and a finish time. 

10 FIG. 8 is a reference diagram showing an output situation 

considering sub picture data. 

Refening to FIG. 8, pixel data Itself is losslessly coded as PXD. 
SP_DCSQT contains Infonnatlon on an SP display area, which Is a sub 
picture display area In which a sub picture is displayed in a video display 
15 area that Is a video Image area, and Information on the start time and 
finish time of output. 

In a DVD, sub picture data for subtitle data of a maximum of 32 
different languages can be multiplexed with video data and recorded. 
Distinction of these different languages is performed by a stream Id 
20 provided by the MPEG system encoding and sub stream id defined in the 
DVD. Accordingly, if a user selects one language, SPUs are extracted 
from only SP_PCKs having stream id and sub stream id corresponding 
to the selected language, then decoded, and subtitle data are extracted. 
Then, output Is controlled according to display control commands. 

25 Many problems arise from the fact that subtitle data are 

multiplexed together with video data as described above. 

First, the amount of bits to be generated for sub picture data 
should be considered when video data are coded. That is, since 
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subtitle data is cx>nverted into graphic data and processed, the amount of 
generated data for respective languages are different from each other 
and also the amounts are huge. Usually, after encoding of moving 
pictures is performed once, sub picture data for each language Is again 
5 multiplexed being added to the output of the encoding such that a DVD 
appropriate to each region is produced. However, depending on the 
language, the amount of sub picture data is huge such that when sub 
picture data is multiplexed with video data, the entire amount of 
generated bits exceeds a maximum allowance. In addition, since sub 
10 picture data is multiplexed between video data, the start point of each 
VOBU is different according to the region. Since the start point of a 
VOBU Is separately managed, whenever a multiplexing process newly 
begins, this information should be updated. 

Secondly, since the contents of each sub picture cannot be known, 
15 sub picture data cannot be used for additional purposes, such as for 
outputting two languages at a time for a language by outputting only 
subtitle data. 

Disclosure of the Invention 

20 The present invention provides an Information storage medium on 

which sub picture data is recorded with a data structure in which when 
video data are coded, the amount of bits to be generated for sub picture 
data need not be considered in advance and an apparatus therefor. 

The present invention also provides an information storage 

25 medium on which sub picture data is recorded with a data structure in 
which sub picture data can be used for purposes other than subtitles and 
an apparatus therefor. 

Additional aspects and/or advantages of the invention will be set 
forth in part in the description which follows and, in part, will be obvious 
30 from the description, or may be learned by practice of the invention. 
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According to an aspect of the present invention, there is provided 
an infomaation storage mediunn on which video data are recorded, 
including: a plurality of dips that are recording units in which the video 
data are stored; and text data for subtitles which are recorded separately 
5 from the plurality of clips and overlappable with an image according to 
the video data and then outputtable, the text data including data for 
providing subtitles in at least one language. 

The information storage medium may include character font data, 
which are recorded separately from the plurality of clips, for graphic 
10 expression of the text data and are which are usable in the text data. 

When the text data is of multiple languages, the text data may be 
recorded In separate spaces for each of the multiple languages. 

The text data may include character data which are convertible 
Into graphic data and output synchronization information for 
15 synchronizing the graphic data with the video data. 

The text data may Include character data which are convertible 
into graphic data and output location information indicating a location in 
which the graphic data Is to be displayed when the graphic data is 
overlapped with an image according to the video data. 

20 The text data may include character data which are convertible 

Into graphic data and information for expressing the output of the graphic 
data in a plurality of sizes when the graphic data is overlapped with an 
image. 

The video data may be divided Into units that are continuously 
25 reproducible, and a size of all of the text data corresponding to one unit 
Is limited. 
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The video data may be divided into a plurality of units that are 
continuously reproducible, the text data corresponding to each 
reproducing unit being divided into a plurality of language sets, and a 
size of all of the text data forming one language set being limited. 

5 The data forming the text data may be expressed and recorded in 

Unicode for supporting multi-language character sets. 

When the text data for subtitles are formed only with characters of 
one of ASCII, which is a basic English character set, and 1808859-1, 
which is a Latin-extended character set, the text data may be coded and 
10 recorded by using UTF-8 by which one character is coded into a plurality 
of 8-bit units. 

When the text data includes a character having a code point value 
of a 2-byte size in Unicode, the text data may be coded and recorded by 
using UFT-16 by which one character is coded into a plurality of 16-bit 
15 units. 

The information storage medium may be a removable type. 

The information storage medium may be an optical disc which is 
readable by an optical apparatus of the reproducing apparatus. 

According to another aspect of the present invention, there is 
20 provided a reproducing apparatus which reproduces data from an 

information storage medium on which video data Is receded, the video 
data being coded and divided into clips that are recording units and 
recorded in a plurality of clips and on which text data for subtitles that are 
formed with data of a plurality of languages and are overlappable as 
25 graphic data with an image based on the video data, the text data being 
recorded separately from the clips, the reproducing apparatus including: 
a data reproducing unit which reads data from the Information storage 
medium; a decoder which decodes the coded video data; a renderer 
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which converts the text data into graphic data; a blender which overiays 
the graphic data with the video data to generate an image; a first buffer 
which temporarily stores the video data; and a second buffer which 
stores the text data, 

5 Font data may be stored in a third buffer and are usable in the text 

data for graphic expression of the text data and are recorded separately 
from the clips on the information storage medium, and the renderer 
converts the text data Into graphic data using the font data. 

When the text data are data of multiple languages, the text data 
10 may be recorded in separate spaces for each of the languages, wherein 
text data for a language that is one of selected by a user and set as an 
Initial reproducing language s are temporarily stored In the second buffer, 
font data for converting the text data into graphic data may be 
temporarily stored in the third buffer, and, simultaneously, while ' 
15 reproducing video data, the text data may be converted into graphic data 
and the graphic data may be output. 

The apparatus may Include a controller which controls an output 
start time and end time of the text data using synchronization information. 
On the information storage medium may be recorded the text data 
20 which includes the synchronization information, by which the text data 
are converted into graphic data which are overlapped with an Image 
based on the video data. 

The apparatus may include a controller which controls a location 
where the text data is overlapped with an Image based on the video data 
25 using output location Information. On the information storage medium 
may be recorded the text data includes character data which are 
convertible into graphic data, and the output location information 
indicating a location where the graphic data Is to be output when the 
graphic data is overlapped with an image based on the video data. 
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The video data recorded on the information storage medium may 
be divided into units that are continuously reproducible, and within a 
limited size of all of the text data corresponding to the recording unit, the 
text data are recorded. All of the text data whose size is limited may be 
5 stored in the second buffer before reproducing the continuously 
reproducible units, and when a language change occurs during 
reproduction, subtitle data corresponding to the language stored in the 
buffer may be output. 

The video data may be divided into units that are continuously 
reproducible, the text data corresponding to one unit are divided Into a 
plurality of language sets, the text data for subtitles forming the one 
language set are recorded so that all of the text data is limited. The text 
data con^esponding to a language set containing the subtitle data which 
are output simultaneously with video data, may be stored In the buffer 
before reproducing the unit that Is continuously reproducible, and when a 
language change occurs during reproduction, when the text data for the 
language are In the buffer, the text data for the language may be output, 
and when the text data for the language are not in the buffer, the text 
data corresponding to the language set containing the text data for the 
language are stored in the buffer and the text data for the language may 
be output. 

The apparatus may include a subtitle size selector which selects a 
size of the subtitle data based on a user input. The text data may 
include character data, which are convertible into graphic data, and 
25 information indicating the output of a plurality of graphic data items when 
the graphic data is overiapped with an image based on the video data 
may be recorded on the information storage medium. 

Data forming the text data may be expressed and recorded in 
Unicode for supporting multi-language sets, and the renderer converts 
30 the characters expressed in Unicode into graphic data. 

10 
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On the Information storage medium, when the text data for 
subtitles are formed only with characters of one of ASCII, which Is a 
basic English character set, and 1808859-1, which is a Latin-extended 
character set, the text data may be coded and recorded by using UTF-8 
5 by which one character is coded into a plurality of 8-bit units, and the 
renderer may convert the characters expressed by UFT-8 into graphic 
data. 

On the information storage medium, when the text data includes a 
character having a code point value of a 2-byte size in Unicode, the text 
10 data may be coded and recorded by using UFT-16 by which one 

character is coded into a plurality of 16-bit units, and the renderer may 
convert the characters expressed by UTF-16 Into graphic data. 

The information storage medium may be a removable type, and 
the reproducing apparatus may reproduce data recorded on the 
15 removable information storage medium. 

The information storage medium may be an optical disc which Is 
readable by an optical apparatus of the reproducing apparatus, and the 
reproducing apparatus may reproduce data recorded on the optical disc. 

The reproducing apparatus may output the graphic data without 
20 reproducing video data recorded on the Information storage medium. 

The subtitle data may include subtitle data for one or more 
languages and the renderer may convert text data for the one or more 
languages Into graphic data. 

The subtitle data may be synchronously overlapped with a video 
25 image and then output 

According to still another aspect of the present invention, there is 
provided A recording apparatus which records video data on an 
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information storage medium. Including: a data writer which writes data on 
the information storage medium; an encoder which codes video data; a 
subtitle generator which generates subtitle data addable to the video 
data; a central processing unit (CPU); a fixed-type storage; and a buffer. 

5 The video data is stored in the fixed-type storage after the encoder 
divides video images into clips that are recording units and compression 
encodes the clips. The subtitle generator generates subtitle data for a 
plurality of languages in the form of a text, the subtitle data being 
reproducible together with an image based on the video data and stored 

10 in the fixed-type storage. The buffer temporarily stores the data stored 
in the fixed-type storage. The data writer records the coded video data 
and subtitle data that are temporarily stored in the buffer on the 
infomriation storage medium. The CPU controls encoding of the video 
data, recording the coded video data and the subtitle data in respective 

15 separate areas on the information storage medium. 

The apparatus may include a font data generator which generates 
font data for converting text data for subtitles Into graphic data. The font 
data generator may generate font data needed for converting the subtitle 
data into graphic data, and may store the font data in the fixed-type 
20 storage. The buffer may temporarily store the font data stored in the 
fixed-type storage, the data writer may record the font data temporarily 
stored in the fixed-type storage on the information storage medium, and 
the CPU may control the generating of the font data and recording the 
font data in separate areas of the information storage medium. 

25 When the text data are data of multiple languages, the CPU may 

control the subtitle data so that the subtitle data are recorded in a 
separate space for each language. 

The apparatus may include a subtitle generator which generates 
the subtitle data by including character data which are convertible into 
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graphic data and then output and output synchronization infomiatlon for 
synchronizing with reproduction of the video images. 

The subtitie generator may generate the subtitle data by including 
character data which are convertible into graphic data and may output 
s location information Indicating a location where the graphic data will be 
output when the graphic data is overlapped with an image based on the 
video data. 

The subtitie generator may generate the text data by including 
character data which is convertible into graphic data and information for 
10 expressing the output of the graphic data with a pluralrty of sizes when 
the graphic data is overlapped with an image based on the video data. 

The coded video data may be divided into recording units that are 
continuously reproducible, and the subtitle generator may generate the 
text data so that a size of all of the subtitie data corresponding to the 
IS recording unit is limited. 

The coded video data may be divided into recording units that are 
continuously reproducible, and after the text data con-esponding to the 
recording unit are divided into a plurality of language sets, the subtitie 
generator may generate the text data so that a size of the entire subtitie 
20 data forming the one language set is limited. 

The subtitie generator may generate data forming the text data in 
Unicode for supporting multi-language character sets. 

The encoder may encode by using UTF-8 by which one character 
is coded into a plurality of 8-bit units when the text data are formed only 
25 with characters of one of ASCII, which is a basic English character set, 
and IS08859-1 , which is a Latin-extended character set. 
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The encx^der encodes by using UFT-16 by which one character is 
coded into a plurality of 1&-bit units when the text data includes a 
character having a code point value of a 2-byte size in Unicode. 

The information storage medium may be a removable type. 

5 The information storage medium may be an optical disc. 

According to yet another aspect of the present invention, there Is 
provided a method of reproducing data stored on an information storage 
medium, including: reading audio-visual (AV) data and text data; 
rendering subtitle image data from the text data; decoding the AV data 
10 and outputting decoded AV data; and blending the subtitle image data 
and the decoded AV data. 

According to still another aspect of the present invention, there is 
provided a reproducing apparatus including: a reading section which 
reads audio-visual (AV) data, text data, and font data; a decoder section 
15 which decodes the AV data and outputs moving picture data; a rendering 
section which renders subtitle image data from the text data; and a 
blending section which synthesizes the moving picture data with the 
subtitle image data. 

According to yet another aspect of the present invention, there is 
20 provided a reproducing apparatus including: a reading section which 
reads text data and font data; a rendering section which renders subtitle 
image data from the text data; and an outputting section which outputs 
the subtitle image data an input receiving section which receives an input 
to subtitle data for a next line so as to control the output time of the 
25 subtitle data. 

According to yet another aspect of the present invention, there is 
provided a data recording and/or reproducing apparatus including: a 
storage section; an encoder which codes audio-visual (AV) data to yield 
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coded AV data; a subtitle generator which generates renderable text data 
for subtitles; a data writer which writes the coded AV data and the 
renderable text data onto the storage section; a reading section which 
reads the coded AV data and the rederable text data; a decoder section 
5 which decodes the coded AV data so as to yield moving picture data; a 
rendering section which renders subtitle image data from the renderable 
text data; and a blending section which synthesizes the moving picture 
data with the subtitle image data so as to yield blended moving picture 
data. 

10 To achieve the above and/or aspects and advantages, on an 

information storage medium according to various embodiments of the 
present invention, each subtitle data Item is not coded together with AV 
data and within AV data, but is recorded in the form of separate text data 
in a separate recording space. In addition, on the infonnation storage 

15 medium, separate font data for rendering subtitle data that is In the fonm 
of text data is recorded. Also, synchronization information for 
interiocking subtitle data with AV moving pictures for which decoding 
process is finished, and output information for screen output are 
recorded. The subtitle data corresponds to sub picture data in the 

20 conventional DVD. That is, on the information storage medium 

according to various embodiments of the present invention, the following 
elements are recorded: 

1 ) AV data (clip) into which video information is compression 
encoded; 

25 2) text data for multi-language subtitles; and 

3) font data for rendering text data. 

Brief Description of the Drawings . 
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FIG, 1 is a diagram of a data structure for a DVD; 
FIG. 2 is a detailed diagram of a VMG area; 
FIG. 3 is a detailed diagram of a VTS area; 
FIG. 4 is a detailed diagram of a VOBS that is video data; 
5 FIG. 5 is a diagram showing the relation between an SPU and 

SP_PCK; 

FIG. 6 is a diagram of the data structure of a sub picture when it is 
encoded; 

FIG. 7 is a diagram of the data stnjcture of SP_DCSQT; 
10 FIG, 8 is a reference diagram showing an output situation with sub 

picture data considered; 

FIG. 9 is a block diagram of a reproducing apparatus according to 
an embodiment of the present invention; 

FIG. 10 is a diagram of the data structure of text data stored in an 
15 information storage medium according to an embodiment of the present 
invention; 

FIG. 11 is an embodiment of text data for subtitles according to an 
embodiment of the present invention; 

FIG. 12 is a diagram of the data structure of text data for a 
20 language other than the language of FIG. 1 1 ; 

FIG. 13 Is an example of a text file used in the present invention; 

FIG. 14 is an example of a subtitle to which a different style Is 
applied; 

FIG. 15 is an example of a subtitle displayed after changing a line; 
25 FIG 16 is an example showing a case where a user executes a 

language change while subtitles in a language are being reproduced; 

FIG. 17 is an example of a plurality of language sets of subtitle 
data and font data for multiple languages; 

FIG. 18 is a diagram showing correlations of PlayList, Playltem, 
30 clip information, and a clip; 
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FIG. 19 is an example of a directory structure according to the 
present Invention; 

FIG. 20 is an example showing a case where a reproducing 
apparatus outputs only subtitle data; 
5 FIG. 21 is an example showing a case where a reproducing 

apparatus outputs subtitle data for more than one language at the same 
time; 

FIG. 22 is an example showing a case where during reproduction 
of only subtitle data, normal reproduction of video data begins from video 
10 data corresponding to subtitle line data; and 

FIG. 23 Is a block diagram of a recording apparatus according to 
an embodiment of the present invention. 

Best mode for carrying out the Invention 

15 Reference will now be made in detail to embodiments of the 

present invention, examples of which are illustrated in the accompanying 
drawings, wherein like reference numerals refer to the like elements 
throughout. The embodiments are described below to explain the 
present invention by refem'ng to the figures. 

20 FIG. 9 is a block diagram of a reproducing apparatus according to 

an embodiment of the present invention. 

Referring to FIG. 9, the reproducing apparatus includes a reader 
which reads AV data, text data for subtitles, and downloaded font data 
stored in an information storage medium, a decoder for decoding AV 
25 data, a renderer which renders text files, and a blender which 

synthesizes moving pictures output from the decoder with subtitle data 
output from the renderer. 

In addition, the reproducing apparatus further includes a buffer, 
which buffers data between the reader and the decoder and renderer s 



17 



wo 2004/036574 



PCT/KR2003/002120 



and stores determined font data, and may further include a storage (not 
sliown) for storing resident font data tliat are stored in advance as 
defaults. 

As used herein, rendering encompasses all needed activities 
5 related to converting subtitle text data into graphic data so as to be 

displayed on a display apparatus. That is, rendering includes producing 
graphic data to form a subtitle image by repeating the process for finding 
a font matching with the character code of each character in the text data 
in the downloaded font data read from the information storage medium 

10 or from the residing font data, and converting the font data into graphic 
data. Rendering also includes selecting or converting colors, selecting 
or converting the size of characters, and producing graphic data 
appropriate to writing in horizontal lines or vertical lines. In particular, 
when the font data being used is an outline font, font data defines the 

15 shape of each character as a curve fomnula. In this case, rendering 
also includes a rasterizing process for generating graphic data by 
processing the curve formula. 

FIG. 10 is a diagram of the data structure of text data (i.e., subtitle 
data) stored in an information storage medium according to an 
20 embodiment of the present invention. 

Referring to FIG. 10, text data is recorded separately from AV 
streams. The text data includes synchronization infomiation, display 
area infonnation, and display style box infomnation. The 
synchronization infomiation is addable to data to be output with subtitles 

25 in a rendering process and is usable for synchronizing the subtitles with 
video Information which is decoded from AV stream data. The display 
area information designates a location on which rendered subtitle data 
are displayed on a screen. Display style box Information contains 
information on the size of characters, writing of rendered subtitle data in 

30 horizontal lines or in vertical lines, and arrangement, colors, contrast, etc., 
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in a display area. In addition, since text data for each of a plurality of 
languages may be written, the text data also contains information 
indicating a language of the plurality of languages. This so-called 
multi-language data may be stored in separate spaces for each of the 
5 respective languages, or may be stored In one space after being 
multiplexed in order of output time, 

FIG. 11 Is illustrates text data for subtitles according to an 
embodiment of the present invention. 

Referring to FIG. 11, a markup language is used as text data for 
10 subtitles in the present embodiment. Considering that the purpose of 
use is for subtitles, a minimal number of tags or elements in the markup 
language used for subtitles are used, and as described above, tags or 
attributes for synchronization and screen display may be included. Here, 
subtitle, head, meta, body, p elements are shown as examples. In the 
15 present embodiment, information is displayed with an attribute. 
Attributes used in the example are as follows: 

- start: A time at which subtitle data corresponding to moving 
pictures should be output when the start time of the moving 
pictures that should be reproduced together with the subtitle data 

20 is set to 0. A time at which subtitles are displayed is expressed 

in the form of time (HH): minute (MM): second (SS): frame (FF). 
The time can be expressed in units of 1/1000 second. Also, if 
video data is MPEG video, the time may have a presentation time 
stamp (PTS) value of video images on which the subtitle overiays 

25 and is displayed. Generally, the PTS value is a count value 

operating at 27MHz or 90kHz. If the PTS value is used, the 
subtitle data can be accurately matched with video data and 
operated. 
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- end: A time at which a displayed subtitle disappears and has the 
same type of attribute value as 'start'. 

- position: This indicates the coordinates of the top left-hand 
vertex in a video area in a display area in which subtitle data is to 

5 be displayed. 

- direction: This indicates the direction of subtitle data to be 
displayed. 

- size: This indicates the width or height of a display area in which 
subtitle data is to be displayed. If the attribute value of 

10 "direction" is "horizontal", a fixed width value of a subtitle data box 

is indicated, and if "vertical", a fixed height value of the subtitle 
data box is indicated. 

Among used elements, a subtitle element is used to indicate the 
root of text data, and a head element is used to include a meta element 

15 which deals with information needed by all of the text data, or a style 
element which is not shown in the example of FIG. 11 . In the present 
embodiment, a meta element is used to express the title of the 
corresponding text data and the language to be used. That is, when 
multiple languages are selected, by using meta information in the text 

20 data, a desired language text file can be conveniently selected. Also, 
languages can be distinguished by the names of text files, or by directory 
names, if a different directory for each language text file is prepared. 

Thus stored subtitle data is loaded into the buffer of the 
reproducing apparatus before video data is reproduced, and with the 
25 reproduction of video data, the subtitle data is converted into graphic 
data by the renderer and made to overlap video images. Accordingly, 
the subtitle data in, for example, Korean, is displayed in a display area at 
an exact time. As described above, for the text data, in addition to the 
subtitle character data, control information may also be written in a 

20 
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format or syntax. Accordingly, the Tenderer has a parser function for 
verifying that a text file to be stored is written according to a syntax. 
Also, in order to synchronize the subtitle data with video images decoded 
by the decoder by using the synchronization information included in the 
5 text file, there is a channel through which events for sending or 

determining information on the reproducing time and the reproducing 
state of the decoder are exchanged with the decoder. 

FIG. 12 is a diagram of the data structure of text data for a 
language other than the Korean language of FIG, 11 . 

10 Referring to FIG 12, when video data and text data are recorded 

in different areas, support for multiple languages is achievable by coding 
the video data separately from the subtitle data and then adding text data 
of respective different languages to the coded video data. Also, when 
subtitle data and font data that are not stored with video data on the 

15 information storage medium are downloaded through networks or loaded 
on the reproducing apparatus from an additional information storage 
medium, thus, subtitle data is easily used in other cases. 

When multiple languages are thus supported, a character code to 
be used for the text data should be determined. In an embodiment, 

20 Unicode is used. Unicode is a character code made to express 
languages throughout the world with more than 65,000 characters. 
According to the Unicode, each character is expressed by a code point in 
Unicode. Characters to express respective languages are sets of code 
points having regularly continuous values. The characters having a 

25 continuous space of code points are referred to as a code chart. Also, 
Unicode supports UTF-8, UTF-16, and UTF-32 as coding fomiats for 
actually storing or transmitting character data, that is, the code points. 
These formats are to express one character by using a plurality of data 
items with an 8-bit length, 16-bit length, and 32-bit length, respectively. 
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An ASCII code for expressing English characters and an 
IS08859-1 code for expressing languages of European countries by 
expanding Latin have code point values from 0x00 to OxFF in Unicode. 
Japanese Hirakana characters have code point values from 0x3040 to 
5 0x309F. The 11 ,172 characters for expressing modern Korean have 
code point values from OxACOO to 0XD7AF. Here, Ox indicates that the 
code point value is expressed by hexadecimal numbers. 

If subtitle data includes only English characters, the coding is 
performed by using UTF-8, For Korean or Japanese subtitle data, if 
10 UTF-8 is used, one character is expressible using 3 bytes. If UTF-18 is 
used, one character is expressible in 2 bytes but each of the English 
characters included in the subtitle data at is also expressible in 2 bytes. 

Each country has its own character code different from Unicode. 
For example, in the Korean character code set, KSC5601, a Korean 
15 character has a 2-byte code point value and an English character has 
1 -byte code point value. If the subtitle data is generated by using a 
code other than Unicode but each nation's character set, each 
reproducing apparatus understands all of these character sets such that 
the load for implementation increases. 

Font data is needed in order to process subtitle data as text data. 
Also, in order to support multiple languages, the font data supports 
multiple languages. However, it is difficult to manufacture all 
reproducing apparatuses having these fonts that support multiple 
languages. Accordingly, in this embodiment of the present invention, 
font data only for the characters used in an information storage medium 
are recorded in the information storage medium as subtitle data such 
that in a reproducing apparatus, such font data is loaded into a buffer 
before reproducing video data and then used. That is, the reproducing 
apparatus linl^s each piece of subtitle text data with font data and then 
reproduces the data. Link information of subtitle text data and font data 
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is recorded in the text data for subtlties or in a separate area. 
Considering a case where a user executes a ianguage change during 
reproduction of data, the reproducing apparatus ioads subtltie data and 
font data, which correspond to video data and is continuously 

5 reproducible before reproduction, and then uses the data. Here, 
continuous reproduction encompasses reproduction without pause, 
cessation, or interruption In the video and audio outputs of the video data. 
Generally, a reproducing apparatus reproduces data by storing an 
amount of data in a video and audio buffer and rf underflow in the buffer 

10 of the reproducing apparatus is prevented, continuous reproduction is 
possible. When subtitles or font data conresponding to video data are 
read again through the reader in order to change subtitles during 
reproduction, if underflow of the video and audio data does not occur 
during the time, loading in advance may not be needed. 

IS FIGl 1 3 is an example of a text file used in this embodiment of the 

present invention. 

Refem'ng to FIG. 13, in this embodiment of the present 
embodiment, a style element is used in a head element in order to use a 
CSS file format as an application of a style in a marl^up language for 
20 implementing a text file. By using CSS, subtitle data can use a variety 
of fonts with different sizes and colors. 

In some applications or with some users, subtitle styles that are 
set as defaults are not convenient. For example, a person with bad 
eyesight may feel inconvenience if the size of the font of the subtitle text 
25 is small. Accordingly, it is desirable to apply and display a style to 
satisfy ordinary users or persons with bad eyesight when applied to an 
identical text file. Therefore, by allowing users to determine the style, 
such as the size of a font, through a menu when reproducing an 
information storage medium in a first reproducing apparatus, a style 
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sheet which is for applying a style according to a user's settings and has 
a plurality of options that are selectable by the user can be used. 

In the present Invention, an ©user rule by which a subtitle style 
according to a user Is settable will now be explained. User type is a set 
5 of CSS attributes. In the present embodiment, a detailed distinction of 
user types, that Is, the degree of bad eyesight, is not relevant, and 
therefore, only the two following cases as follows will be explained: 

- small: a style for a user with normal eyesight; and 

- large: a style for a user with bad eyesight 

10 As shown in FIG. 14, subtitles which are preset by using an @user 

rule or to which different styles are applied for users with good eyesight 
or with bad eyesight can be displayed. 

It is also possible for a reproducing apparatus to output subtitles 
with applying a different position and size according to the user's 
15 preference without using the position and size determined by the subtitle 
data. 

FIG. 1 5 is an example in which the text data for the Korean 
subtitles implemented in FIG. 11 are displayed on an actual screen. 

Referring to FIG. 16, since in the screen expressed by the second 
20 <p> element, the width value of the subtitle data display area is fixed to 
520 by the "size" attribute, subtitle data that cannot be expressed within 
one line is displayed after changing a line. Alternatively, subtitle data is 
outputtable only In a display area and by using a line change element 
(br), line change can be selected forcibly. 

25 The third <p> element is an example in which by a "direction" 

attribute, the display of subtitle data is vertically performed. 
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FIG. 16 Is an example showing a case where a user executes a 
language change while subtitles In a language are being reproduced. 

Referring to FtQ 16, when a language change is needed, a 
reproducing apparatus changes subtitle text data being reproduced (in 
5 Korean, for example), links font data corresponding to text data, renders 
data of the changed language (English, for example), and by doing so, 
outputs the subtitles. If data for subtitles and font data for this are all 
loaded in the buffer, continuous reproduction of video data can be easily 
performed. If text data or font data desired to be changed is not loaded 
10 in the buffer, the data should be loaded into the buffer. At this time, a 
pause, cessation, or interruption can occur in reproduction of video data. 

For multi-language conversion without pause, cessation, or 
interruption of video reproduction, the sizes of data for subtitles and font 
data are limitable to less than the sizes of the respective buffers. In this 
15 case, however, the number of supported languages is restricted. 
Accordingly, in the present embodiment of the present invention, this 
problem is solved by creating a unit referred to as a language set. 

FIG. 17 Is an example of a plurality of language sets of subtitle 
data and font data for multiple languages. 

20 Referring to FIG. 17, subtitle data and font data for a plurality of 

languages added to one video image are divided into a plurality of 
language sets. Subtitle data and font data that correspond to one 
language set are limited to a size that is less than the size of the buffer. 
After a language set containing subtitle data of a language selected by a 

25 user or selected as a default by the reproducing apparatus is loaded in 
the buffer before reproducing video data, reproducing video data begins. 
When the user executes a language change, the language change with 
the subtitle data Included in this language set can be done without cease 
because the data is already loaded in the buffer However, if a change 
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to a language not Included in this language set is made, the reproducing 
apparatus loads again the subtitle data and font data of the desired 
language set. In this case, data of the existing language set is all 
deleted. At this time, in reproducing video data, a pause, cessation, or 
5 intenuption may occur. Thereafter, if a language change is performed, 
a language change operation Is performed again according to the 
relation between the language and the language set loaded in the buffer, 
information on the language set is recordable on an information storage 
medium or by considering the data stored in an information storage 
10 medium and the size of the buffer in the reproducing apparatus, and the 
reproducing apparatus determines this arbitrarily when reproducing data. 

The relation between information needed in reproducing video 
data and the subtitle data will now be explained with an embodiment. 

As used herein, a clip is a recording unit of video data, and 
15 PlayList and Play Item will be used to indicate reproducing units. 

In an information storage medium according to an embodiment of 
the present invention, AV streams are separated and recorded In units of 
clips. Usually, a clip is recorded in a continuous space. In order to 
reduce the volume, AV streams are compressed and recorded. 

20 Accordingly, in order to reproduce the compressed AV streams, attribute 
information of the compressed video data should be informed. 
Therefore, Clip information is recorded in each clip. Clip information 
contains audio video attributes of the clip and an Entry Point Map in 
which information on the location of an Entry Point where random access 

25 is available in each interval is recorded. In an MPEG, which is widely 
used as a video compression technology, the Entry Point is the location 
of I picture where an intra image is compressed, and the Entry Point Map 
is mainly used for a time search used to find a point in a time interval 
after the starting point of reproduction. 
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PlayList is a basic unit of reproduction. In an information storage 
medium according to the present embodiment, a plurality of PlayLists is 
stored. One PlayList includes a series of a plurality of Playltems. 
Playltem corresponds to a part of a clip, and more specifically, it is used 
5 in the form by which a reproduction start time and end time in the clip are 
determined. Accordingly, by using Clip information, the location of the 
part in an actual clip corresponding to the Playltem is identified. 

FIG. 18 is a diagram showing correlations of a PlayList, a Playltem, 
Clip Information, and a clip. 

10 Refen-ing to FIG. 18, in addition to a PlayList, a Playltem, Clip 

information, and a clip, in the present embodiment of the present 
invention, a plurality of text data items for subtitles for each clip are 
recorded in a space separate from the clip. A plurality of data items for 
subtitles are linked to one clip and this link information is recordable in 

15 the Clip information. To some clips, a plurality of data Items for subtitles 
are linked, but for some clips, no data items or only one data item for 
subtitles may be linked. When PlayList is reproduced, Playltems 
included in the PlayList are sequentially reproduced. As a result, any 
one of the clips linked to each Playltem and a plurality of subtitles linked 

20 to the clip are rendered and output Since continuous reproduction 
between PlayLists is usually not guaranteed, all linked text data for 
subtitles is loadable into a buffer before reproducing the PlayList. In FIG. 
18, font data Is not separately marked. 

Usually, font data is generated for each language. Accordingly, 
25 font data is recorded in a separate space for each language. 

FIG. 19 is an example of a directory structure according to an 
embodiment of the present invention. 

Referring to FIG. 19, in a directory, clip. Clip information, a PlayList, 
subtitle text data, and font data are stored in the form of files and stored 
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in different directory spaces according to the respective types. As 
shown, text files for subtitles and font files are storable in directory 
spaces separate from video data. 

An information storage medium according to various 
5 embodiments of the present invention is a removable information storage 
medium (i.e., one which is not fixed to a reproducing apparatus and, only 
when data is reproduced, can be placed and used). Unlike a fixed 
information storage medium with a high capacity such as a hard disc, the 
removable information storage medium has a limited capacity. Also, 

10 reproducing apparatuses for reproducing this medium often have a buffer 
with a limited size and low level function s with limited performance. 
Accordingly, together with video data recorded on a removable 
information storage medium, only subtitle data and font data used for the 
subtitle data are recorded on the information storage medium and by 

15 using the data when video data is reproduced from the information 

storage medium, the amount of data that should be prepared in advance 
can be minimized. A representative example of this removable 
recording medium is an optical disc. 

On an information storage medium according to an embodiment 
20 of the present invention, video data is stored In a space separate from 
subtitle text data. If this subtitle text data is for multiple languages and 
has font data for outputtihg the subtitle data, a reproducing apparatus 
loads only the subtitle data and font data in the buffer and then, while 
reproducing video data, overlaps the subtitle data with a video Image and 
25 outputs the subtitle data. 

FIG. 20 is an example showing a case where a reproducing 
apparatus outputs only subtitle data. 

Referring to FIG 20, a reproducing apparatus according to an 
embodiment of the present invention may output only subtitle data. 



28 



wo 2004/036574 PCT/KR2003/002120 



That is, according to one of the many special reproduction functions, 
video data is not reproduced, and only subtitle data that is to be output 
overlapping the video data is converted into graphic data and then output. 
In this case, subtitle data may be used, for example, for learning a 
5 foreign language. Here, video data is not overlapped and only subtitle 
data is output Also, both the synchronization information and location 
information are neglected or not included, and the reproducing apparatus 
outputs a plurality of line data items including subtitle data on the entire 
screen, and waits for a user input. After watching all of the output 
10 subtitle data, the user sends a signal for displaying subtitle data for the 
next line to the reproducing apparatus so as to control the output time of 
the subtitle data. 

FIG. 21 is an example showing a case where a reproducing 
apparatus outputs subtitle data for more than one language at the same 
15 time. 

Referring to FIG. 21, as an embodiment, a reproducing apparatus 
may have a function for outputting subtitle data for two or more 
languages at the same time when subtitle data includes a plurality of 
languages. At this time, by using synchronization information of subtitle 
20 data for each language, subtitle data to be displayed on the screen is 
selected. That is, subtitle data is output in order of output start time, 
and when the output start times are the same, the subtitle data is output 
according to language. 

A function, by which while only subtitle data are reproduced, 
25 normal reproduction of video data can be started from the video data 
corresponding to a subtitle line data item, is also implementable. 

FIG. 22 is an example showing a case where during reproduction 
of only subtitle data, normal reproduction of video data begins from video 
data conresponding to subtitle line data. 



29 



wo 2004/036574 PCT/KR2003/002120 



As shown in FIG. 22, when the user selects one subtitle line data 
item, a reproducing time corresponding to the line data item is selected 
again, and video data corresponding to the time is normally reproduced. 

A recording apparatus according to an embodiment of the present 
5 invention records video data and subtitle data on an information storage 
medium. 

FIG. 23 is a block diagram of a recording apparatus according to 
an embodiment of the present invention. 

Referring to FIG 23, the recording apparatus includes a central 
10 processing unit (CPU), a fixed high-capacity storage, an encoder, a 
subtitle generator, a font generator, a writer, and a buffer. 

The encoder, subtitle generator, and font generator may be 
implemented by software on the CPU. 

In addition, a video input unit for receiving video data in real time 
15 is also includable. 

The storage stores a video image that is the object of encoding, or 
video data that is coded by the encoder. In addition, the storage stores 
a dialogue attached to the video data and large volume font data. The 
subtitle generator receives information on the output time of a subtitle 

20 line data item from the encoder, receives subtitle line data from the 
dialogue data, makes subtitle data for the subtitles, and stores the 
subtitle data in a fixed-type storage apparatus. The font generator 
generates font data containing characters used in the subtitle data for 
subtitles from the large volume font data and stores the font data in the 

25 fixed-type storage apparatus. That is, the font data stored in the 

information storage medium is part of the large volume font data stored 
in the fixed-type storage apparatus. This process for generating data In 
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the form to be stored in an information storage medium is referred to as 
authoring. 

If the authoring process is finished, coded video data stored in the 
fixed-type storage apparatus are divided into dips, which are the 
5 recording units, and recorded on an information storage medium. Also, 
subtitle data for subtitles added to video data contained in the clip are 
recorded in a separate area. Further, font data needed to convert the 
subtitle data into graphic data is recorded in a separate area. 

The video data is divided into reproducing units that are 
10 continuously reproducible, and usually, this reproducing unit includes a 
plurality of clips. As an embodiment, the size of subtitle data, which are 
overlappable with a video Image included in one reproducing unit and is 
output, is limited to be less than a size when the data for a plurality of 
languages is all added to the subtitle data. Alternatively, subtitle data, 
15 which should be overiapped with a video image included in one 

reproducing unit, is divided into language sets with which a language 
change is continuously perFormable when video data is reproduced. 
Subtitle data included in one reproducing unit includes a plurality of 
language sets and the size of subtitle data included in one language set, 
20 plus data for a plurality of languages, is limited to less than a size. 

The subtitle data includes character codes using Unicode and the 
data form actually recorded Is codable by UTF-8 or UTF-16. 

Video data, subtitle data for subtitles, and font data recorded In 
the fixed-type storage apparatus are temporarily stored in the buffer and 
25 are recorded on an information storage medium by the writer. The CPU 
executes a software program controlling each device so that these 
functions are performed in order. . 

As described above, according to the above-described 
embodiments of the present invention, text data for multi-language 
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subtitles are made to be a text file and then recorded in a space separate 
from AV streams such that more diverse subtitle are providable to users 
and a recording space arrangement is conveniently performable. 

Font data for this are made to have a minimum size by collecting 
5 characters needed for the subtitle text and are stored separately in an 
infomiation storage medium and used. 

Although a few embodiments of the present invention have been 
shown and described, the present invention is not limited to the disclosed 
embodiments. Rather, it would be appreciated by those skilled in the art 
10 that changes may be made in this embodiment without departing from . 
the principles and spirit of the invention, the scope of which is defined in 
the claims and their equivalents. 



Industrial Applicabilitv 

15 The present invention is applicable to fields related to recording 

and reproduction of moving pictures, particularly in fields in which text 
data of multiple languages must be provided while reproducing moving 
pictures. 
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What is claimed Is: 

1 . An Information storage medium on which video data are 
recorded, comprising: 

a plurality of clips that are recording units in which the video data 
5 are stored; and 

text data for subtitles which are recorded separately from the 
plurality of clips and overlappable with an image according to the video 
data and then outputtable, the text data including data for providing 
subtitles in at least one language. 

10 

2. The information storage medium of claim 1 , further 
comprising character font data, which are recorded separately from the 
plurality of clips, for graphic expression of the text data and which are 
usable in the text data. 

15 

3. The information storage medium of claim 1 , wherein, when 
the text data is of multiple languages, the text data are recorded in 
separate spaces for each of the multiple languages. 

20 4. The information storage medium of claim 1 , wherein the 

text data includes character data which are convertible into graphic data 
and output synchronization information for synchronizing the graphic 
data with the video data. 

25 5, The information storage medium of claim 1 , wherein the 

text data includes character data which are convertible into graphic data 
and output location information indicating a location in which the graphic 
data is to be displayed when the graphic data is overlapped with an 
image according to the video data. 
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6. The Information storage medium of claim 1 , wherein the 
text data includes character data which are convertible into graphic data 
and information for expressing the output of the graphic data in a plurality 
of sizes when the graphic data is overlapped with an image. 

5 

7, The information storage medium of claim 1 , wherein the 
video data are divided Into units that are continuously reproducible, and 
a size of all of the text data conresponding to one unit is limited. 

10 . 8. The information storage medium of claim 1 , wherein the 
video data is divided into a plurality of units that are continuously 
reproducible, the text data conresponding to each reproducing unit being 
divided Into a plurality of language sets, and a size of all of the text data 
forming one language set being limited. 

15 

9. The information storage medium of claim 1, wherein the 
data fomiing the text data are expressed and recorded in Unicode for 
supporting multi-language character sets. 

20 10. The Information storage medium of claim 9, wherein, when 

the text data for subtitles are formed only with characters of one of ASCII, 
which Is a basic English character set, and IS08859-1 , which is a 
Latin-extended character set, the text data being coded and recorded by 
using UTF-8 by which one character is coded Into a plurality of 8-blt 

25 units. 



11 , The information storage medium of claim 9, wherein, when 
the text data includes a character having a code point value of a 2-byte 
size in Unicode, the text data being coded and recorded by using UFT-16 
30 by which one character Is coded into a plurality of 16-blt units. 
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12. The Information storage medium of claim 1 , wherein the 
Information storage medium Is a removable type. 

13. The information storage medium of claim 12, wherein the 
5 information storage medium is an optical disc which is readable by an 

optical apparatus of the reproducing apparatus. 

14. A reproducing apparatus which reproduces data from an 
information storage medium on which video data Is receded, the video 

10 data being coded and divided Into dips that are recording units and 

recorded in a plurality of clips and on which text data for subtitles that are 
formed with data of a plurality of languages and are overlappable as 
graphic data with an image based on the video data, the text data being 
recorded separately from the clips, the reproducing apparatus 
15 comprising: . 

a data reproducing unit which reads data from the information 
storage medium; 

a decoder which decodes the coded video data; 
a renderer which converts the text data Into graphic data; 
20 a blender which overlays the graphic data with the video data to 

generate an Image; 

a first buffer which temporarily stores the video data; and 
a second buffer which stores the text data. 

25 15. The reproducing apparatus of claim 1 4, wherein font data 

are stored in a third buffer and are usable in the text data for graphic 
expression of the text data and are recorded separately from the clips on 
the information storage medium, and the renderer converts the text data 
Into graphic data using the font data. 

30 
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16. The reproducing apparatus of claim 14, wherein, when the 
text data are data of multiple languages, the text data are recorded In 
separate spaces for each of the languages, wherein text data for a 
language that is one of selected by a user and set as an initial 

5 reproducing language s are temporarily stored in the second buffer, 
wherein font data for converting the text data into graphic data are 
temporarily stored in the third buffer, and wherein, simultaneously, while 
reproducing video data, the text data is converted into graphic data and 
the graphic data is output 

10 

17. The reproducing apparatus of claim 14, further comprising 
a controller which controls an output start time and end time of the text 
data using synchronization information, 

wherein on the information storage medium are recorded the text data 
15 which includes the synchronization information, by which the text data 
are converted into graphic data which are overlapped with an image 
based on the video data. 

1 8. The reproducing apparatus of claim 14, further comprising 
20 a controller which controls a location where the text data Is overlapped 

with an image based on the video data using output location information, 
wherein on the information storage medium are recorded the text data 
includes character data which are convertible into graphic data, and the 
output location information Indicating a location where the graphic data is 
25 to be output when the graphic data is overlapped with an Image based 
on the video data. 
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1 9. The reproducing apparatus of claim 1 6, wherein the video 
data recorded on the information storage medium are divided Into units 
that are continuously reproducible, and within a limited size of all of the 
text data corresponding to the recording unit, the text data are recorded, 
5 wherein all of the text data whose size is limited is stored in the second 
buffer before reproducing the continuously reproducible units, and when 
a language change occurs during reproduction, subtitle data 
corresponding to the language stored in the buffer is output. 

10 20. The reproducing apparatus of claim 1 6, wherein the video 

data is divided Into units that are continuously reproducible, the text data 
corresponding to one unit are divided into a plurality of language sets, 
the text data for subtitles forming the one language set are recorded so 
that all of the text data is limited, wherein the text data corresponding to 

15 a language set containing the subtitle data which are output 
simultaneously with video data, are stored In the buffer before 
reproducing the unit that is continuously reproducible, and when a 
language change occurs during reproduction, when the text data for the 
language are in the buffer, the text data for the language are output, and 

20 when the text data for the language are not in the buffer, the text data 
corresponding to the language set containing the text data for the 
language are stored in the buffer and the text data for the language are 
output. 

25 21 . The reproducing apparatus of claim 14, further comprising 

a subtitle size selector which selects a size of the subtitle data based on 
a user input, wherein the text data includes character data, which are 
convertible into graphic data, and information indicating the output of a 
plurality of graphic data items when the graphic data is overiapped with 

30 an image based on the video data are recorded on the information 
storage medium. 
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22. The reproducing apparatus of claim 14, wherein on the 
information storage medium, data forming the text data is expressed and 
recorded in Unicode for supporting muiti-ianguage sets, and the renderer 

5 converts the characters expressed in Unicode into graphic data. 

23. The reproducing apparatus of claim 22, wherein on the 
information storage medium, when the text data for subtitles are formed 
only with characters of one of ASCII, which is a basic English character 

10 set, and IS08859-1 , which is a Latin-extended character set, the text 
data being coded and recorded by using UTF-8 by which one character 
is coded into a plurality of 8-bit units, and the renderer converts the 
characters expressed by UFT-8 into graphic data. 

15 24. The reproducing apparatus of claim 22, wherein on the 

information storage medium, when the text data includes a character 
having a code point value of a 2-byte size in Unicode, the text data are 
coded and recorded by using UFT-16 by which one character is coded 
into a plurality of 16-bit units, and the renderer converts the characters 

20 expressed by UTF-16 Into graphic data. 

25. The reproducing apparatus of claim 14, wherein the 
information storage medium is a removable type, and the reproducing 
apparatus reads data from and reproduces data on the removable 

25 information storage medium. 

26. The reproducing apparatus of claim 25, wherein the 
information storage medium is an optical disc which is readable by an 
optical apparatus of the reproducing apparatus, and the reproducing 

30 apparatus reads and reproduces data recorded on the optical disc. 
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27. The reproducing apparatus of claim 14, wlierein the 
reproducing apparatus outputs the graphic data without reproducing 
video data recorded on the information storage medium. 

5 28. The reproducing apparatus of claim 27, wherein the subtitle 

data further comprises subtitle data for one or more languages, the 
renderer converts text data for the one or more languages into graphic 
data. 

10 29. The reproducing apparatus of claim 27, wherein the subtitle 

data are synchronously overlapped with a video image and then output. 

30. A recording apparatus which records video data on an 
information storage medium, comprising: 
15 a data writer which writes data on the information storage 

medium; 

an encoder which codes video data; 

a subtitle generator which generates subtitle data addable to the 
video data; 
20 a central processing unit (CPU); 

a fixed-type storage; and 
a buffer, 

wherein the video data is stored In the fixed-type storage after the 
encoder divides video Images into clips that are recording units and 
25 compression encodes the clips, 

wherein the subtitle generator generates subtitle data for a 
plurality of languages in the form of a text, the subtitle data being 
reproducible together with an image based on the video data and stored 
in the fixed-type storage, 
30 wherein the buffer temporarily stores the data stored in the 

fixed-type storage, 
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wherein the data writer records the coded video data and subtitie 
data that are temporarily stored in the buifer on the information storage 
medium, and 

wherein the CPU controls encoding of the video data, recording 
5 the coded video data and the subtitle data in respective separate areas 
on the information storage medium. 

31 . The recording apparatus of claim 30, further comprising a 
font data generator which generates font data for converiiing text data for 

10 subtitles into graphic data, 

wherein the font data generator generates font data needed for 
converting the subtitle data into graphic data, and stores the font data in 
the fixed-type storage, the buffer temporarily stores the font data stored 
in the fixed-type storage, the data writer records the font data temporarily 

15 stored in the fixed-type storage on the information storage medium, and 
the CPU controls the generating of the font data and recording the font 
data in separate areas of the information storage medium. 

32. The recording apparatus of claim 30, wherein, when the 
20 text data are data of multiple languages, the CPU controls the subtitle 

data so that the subtitle data are recorded in a separate space for each 
language. 

33. The recording apparatus of claim 30, wherein the subtitle 
25 generator generates the subtitle data by including character data which 

are convertible into graphic data and then output and output 
synchronization information for synchronizing with reproduction of the 
video images. 
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34. The recording apparatus of claim 30, wherein the subtitle 
generator generates the subtitle data by including character data which 
are convertible into graphic data and output location information 
indicating a location where the graphic data will be output when the 

5 graphic data is overlapped with an Image based on the video data. 

35. The recording apparatus of claim 30, wherein the subtitle 
generator generates the text data by including character data which is 
convertible into graphic data and information for expressing the output of 

10 the graphic data with a plurality of sizes when the graphic data is 
overlapped with an image based on the video data. 



36. The recording apparatus of claim 30, wherein the coded 
video data is divided into recording units that are continuously 

15 reproducible, and the subtitle generator generates the text data so that a 
size of all of the subtitle data corresponding to the recording unit is 
limited. 

37. The recording apparatus of claim 30, wherein the coded 
20 video data is divided into recording units that are continuously 

reproducible, and after the text data corresponding to the recording unit 
are divided into a plurality of language sets, the subtitle generator 
generates the text data so that a size of the entire subtitle data forming 
the one language set is limited. 



25 



38. The recording apparatus of claim 30, wherein the subtitle 
generator generates data forming the text data in Unicode for supporting 
multi-language character sets. 
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39. The recording apparatus of claim 37, wherein the perfomis 
encoding] encoder encodes by using UTF-8 by which one character Is 
coded into a plurality of 8-bit units when the text data are formed only 
with characters of one of ASCII, which is a basic English character set, 

5 and IS08859-1, which is a Latin-extended character set. 

40. The recording apparatus of claim 37, wherein the performs 
encoding] encoder encodes by using UFT-16 by which one character is 
coded into a plurality of 16-bit units when the text data includes a 

10 character having a code point value of a 2-byte size in Unicode. 

41 . The recording apparatus of claim 30, wherein the 
information storage medium is a removable type. 

15 42. The recording apparatus of claim 41 , wherein the 

information storage medium is an optical disc. 

43. A method of reproducing data stored on an infomnation 
storage medium, comprising: 
20 reading audio-visual (AV) data and text data; 

rendering subtitle image data from the text data; 
decoding the AV data and outputting decoded AV data; and 
blending the subtitle image data and the decoded AV data. 

25 44. The method of claim 43, further comprising synchronizing 

the subtitle image data with the video information using synchronization 
infomriation included in the decoded AV data. 

45. The method of claim 43, wherein the text data includes 
30 display area information, display style box information, and information 
Indicating a language. 
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46. The method of claim 45, Wherein the display area 
infomiation designates a location on which rendered subtitle image data 
are displayed on a screen. 

5 

47. The method of claim 45, wherein the display style box 
information contains information regarding at lest one of a size of 
displayed characters, a writing of rendered subtitle image data in 
horizontal lines and/or vertical lines, an arrangement, colors, and a 

10 contrast in the display area. 

48. The method of claim 43, further comprising loading the 
subtitle data into a buffer. 

15 49. The reproduction apparatus of claim 43, wherein the AV 

data, the text data, and the font data are stored in an information storage 
medium readable by the reading section. 

50. The reproduction apparatus of claim 43, wherein the 
20 blending yields a video Image having subtitles displayed thereon. 

51 . A reproducing apparatus comprising: 

a reading section which reads audio-visual (AV) data, text data, 
and font data; 

25 a decoder section which decodes the AV data and outputs moving 

picture data; 

a rendering section which renders subtitle image data from the 
text data; and 

a blending section which synthesizes the moving picture data with 
30 the subtitle image data. 
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52. The reproduction apparatus of claim 51 , further comprising 
a buffer section which buffers data between the reading section and the 
decoding and rendering sections. 

5 53, The reproduction apparatus of claim 51 , wherein the AV 

data, the text data, and the font data are stored in an information storage 
medium readable by the reading section. 

54. The reproduction apparatus of claim 51 , wherein at least 
10 one of the text data and the font data are stored in a downloadable 

database. 

55. The reproduction apparatus of claim 51 , wherein the AV 
data is stored in an information storage medium readable by the reading 

15 section. 

56. The reproduction apparatus of claim 51 , wherein the 
rendering section finds a font matching a character code of each 
character in the text data, the fonts being stored in one of a 

20 downloadable database and in a storage section of the apparatus. 

57. The reproduction apparatus of claim 51 , wherein the text 
file includes data for each of one or more languages and the text data 
contains information indicating one of the one or more languages. 

25 

58. The reproduction apparatus of claim 51 , wherein, when the 
text file includes data for each of one or more languages, the text data is 
one of stored as multiplexed data in an area and stored in separate 
areas for each of the one or more languages. 

30 

59. A reproducing apparatus comprising: 
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a reading section wliich reads text data and font data; 
a rendering section which renders subtitle image data from the 
text data; 

an outputting section which outputs the subtitle image data; and 
5 an input receiving section which receives an input to subtitle data 

for a next line so as to control the output time of the subtitle data. 

60. The apparatus of claim 59, wherein the text data and the 
font data are for a plurality of languages, the subtitle information includes 

10 synchronization information for each of the plural languages, and subtitie 
data two or more of the plurality of languages are output at the same 
time. 

61 . A data recording and/or reproducing apparatus comprising: 
15 a storage section; 

an encoder which codes audio-visual (AV) data to yield coded AV 

data; 

a subtitie generator which generates renderable text data for 
subtities; 

20 a data writer which writes the coded AV data and the renderable 

text data onto the storage section; 

a reading section which reads the coded AV data and the 
rederable text data; 

a decoder section which decodes the coded AV data so as to yield 
25 moving picture data; 

a rendering section which renders subtitie Image data from the 
renderable text data; and 

a blending section which synthesizes the moving picture data with the 
subtitie image data so as to yield blended moving picture data. 
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FIG. 13 
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